Random spread on the family of small- world networks 
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We present the analytical and numerical results of a random walk on the family of small-world 
graphs. The average access time shows a crossover from the regular to random behavior with 
increasing distance from the starting point of the random walk. We introduce an independent step 
approximation, which enables us to obtain analytic results for the average access time. We observe 
a scaling relation for the average access time in the degree of the nodes. The behavior of average 
access time as a function of p, shows striking similarity with that of the characteristic length of the 
graph. This observation may have important applications in routing and switching in networks with 
large number of nodes. 

PACS numbers: 05.40.Fb, 05.40.-a, 05.50.+q, 87.23. Ge 
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I. INTRODUCTION 

The small-world network exhibits unusual connection 
properties. On one hand it shows strong clustering like 
regular graphs and on the other hand it shows very small 
average shortest path between any two nodes like ran- 
dom graphs. Watts and Strogatz have proposed a simple 
model to describe small- world networks W. The model 
gives a prescription to generate a one parameter fam- 
ily of graphs, ranging from the highly clustered (regular) 
graphs to the random graphs. 

Various properties of this model have been stud- 
ied 1^, ||, ||, |). The spread and percolation properties 
investigated in Refs. j|, ||, |], |), deal with the spread 
of information (disease) along the shortest path in the 
graph or the spread along the spanning tree. 

In this paper, we study random walk on the family of 
small-world networks. Such a random walk corresponds 
to random spread of information on the network. In 
any realistic application of the spread on a graph, we 
expect the spread to be somewhere in between the two 
extremes viz. shortest path and the random walk. e.g. 
In Milgram's experiment jfj], which studies the connec- 
tion properties of social networks, the path of a letter 
from a randomly chosen point to a fixed target is traced. 
The only condition imposed on the transfer of letter was 
that, the letter should be given to the person whom the 
sender knows by first name. The path followed by such 
a letter would have both random as well as shortest path 
elements in it. Another example is the path of inter- 
net protocol packet which follows a similar algorithm for 
forwarding the packet 

The determination of shortest path between two nodes 
typically requires 0(n 3 ) operations ||, where n is the 
number of nodes. This process becomes prohibitively ex- 
pensive as n becomes very large. Examples of such net- 
works are social networks, telephone networks, internet 
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etc. Another problem with the determination of shortest 
path is the incomplete knowledge of the network. Hence, 
it is clear that an alternate method of generating path 
(which need not be shortest) becomes necessary in these 
networks. Our analysis of random walk shows that av- 
erage access time between two nodes goes as 0(n), for 
small-world geometry. It is benificial to consider a net- 
work with random routing or switching, particularly if 
it has small-world properties. Thus the random rout- 
ing emerges to be both, practical and computationally 
cheaper method for large networks. 

In Section II we discuss analytical and numerical re- 
sults for the average access time of the random walk on 
one parameter family of graph ranging from regular case 
to the random case. We introduce an independent step 
approximation which allows us to get analytical expres- 
sions for the average access time. We discuss these results 
in Section III. It is found that the random walk results 
are similar to that of the shortest path results. Thus 
from the nature of the outcome of an experiment it may 
be difficult to conclude whether the spread was random 
or along the shortest path. An important consequence 
of this result can be in the routing and switching in very 
big networks. Random routing is a promising method, 
particularly if the network has small- world properties. 
Section [Fv| summarizes the results. 



II. RANDOM WALK ON SMALL- WORLD 
GRAPHS 

The random walk on a graph is performed as follows: 
We start with a fixed node (say i) and at each step make 
a jump to a node connected to i with uniform probabil- 
ity l/d(i), where d(i) is the degree of the node i, thus 
performing a random walk. Such a random walk gives a 
finite Markov chain Q . One of the most important quan- 
tities of interest in a finite Markov chain is the average 
access time. Let Di j be the average access time, defined 
as the first passage time to the node j if the walk starts 
from the node i. We denote Dj = Dqj. 

We perform the random walks on the family of graphs 
generated by the algorithm given by Watts and Stro- 
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gatz [Q. The prescription gives a one parameter family 
of graphs which interpolates between the regular case and 
the random case. We refer to this family as the family 
of small- world networks or graphs. The regular graph 
(denoted by p — 0) is a graph with n vertices on a circle 
with each node connected to 2k nearest neighbors. The 
parameter k is suitably chosen to keep the graph sparse 
but connected. The other elements of the family are ob- 
tained by random rewiring of each edge in the graph with 
probability p. It is seen that the small- world behavior is 
prominent around the parameter value p = 0.01. i.e., 
when only 1% edges are rewired. The case p = 1 corre- 
sponds to the random case pTf . 



A. The regular case (p = 0) 
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FIG. 1: The plot of average access time vs. the distance m of 
a site from the starting point (m = 0) for the regular graphs. 
Curves (a), (b) and (c) correspond to k = 1, 3 and 5 respec- 
tively. The points are the result of simulations of random 
walks on graph of size 1000. The lines are the analytical and 
scaling results obtained using equations (^) and (|H|). 



Fig. |l| shows the results of the average access time in 
simulation of random walk on the regular graph with 
1000 nodes for several values of k. The average access 
time shows a linear behavior for small to and it shows 
quadratic nature for large m due to the circular topology. 
The lines are the analytic curves obtained as follows. 

From the expectation values of conditional events, we 
can easily write the recursion relation for average access 
time for the walk starting from node i as C3| 
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(i) 



This is a 2fc-th order difference equation for Di m . We 
first consider the case k = 1. In this case the above 
recursion relation reduces to a quadratic form given by 

1 



Di 



r(-Dj+i,m + -D'i-i, 



1 



(2) 



The eq 
niques 



uation can be solved using standard tech- 
10| and the solution is given by 



Di t m = — (to — i) 2 + A (to — i) + B 



(3) 



where A and B are constants. The constants are de- 
termined using the boundary conditions D m m = and 
D m -n, m = 0. Hence, 



Di 



(2m 



mn 



Without loss of generality, we assume that the walk starts 
from i = 0. So the average access time for site to starting 
from zero is 

D m = —to 2 + mn (4) 

Curve (a) in Fig. |l| shows both the analytical and nu- 
merical results for the case k = 1. The linear and the 
quadratic nature is clearly seen. 

For general value of k, it is not possible to solve the 
difference equation (Q) exactly. The numerical simula- 
tions show that the nature of the curves for different k is 
the same as for k = 1. This suggests that there may be 
a scaling relation for a general k. Using numerical data 
fit we find that the following scaling relation fits the data 
reasonably well. 



D k m (n)^(l + f ,Hk))DL(-) 



(5) 



i=i 



where is the average access time from the site to site 
to on the graph with 2k nearest neighbors, n = 0.86 p2| . 

A possible explanation for such a scaling can be a quo- 
tienting of a graph by sub-graph of size k. 

The Fig |^ explains the quotienting procedure clearly. 
The outer graph in Fig || is of size 30 and the inner graph 
is the quotiented graph which has only nearest neighbor 
connections. While the walk on the quotiented graph is 
described by equation (||), the coefficient probably comes 
from the average time spent in each block. Note that the 
number of blocks that the walker has to pass is Jr. 

From Fig. [l] we see that the scaling relation (||) shows 
an excellent matching with the numerical results for var- 
ious values of k. 



B. The random case (p = 1) 

For the completely random case the access time be- 
comes independent of to and Dij = n — 1, V i, j. This 
result can be obtained as follows. 

We note that to calculate average access time one must 
consider an ensemble of graph for a given p. The aver- 
age access time is obtained by first averaging over several 
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FIG. 2: Figure describes the quotienting procedure of reduc- 
ing the regular graph with n nodes and 2k neighbors to a 
graph with £ nodes and 2 neighbors. The outer graph is of 
30 nodes and the inner quotiented graph is of 15 nodes. 
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realizations of random walk on a given graph and then 
it is averaged over various members of the ensemble of 
graphs. We now introduce an independent step approxi- 
mation where we assume that the order of these two av- 
erages is interchanged. Thus in this approximation each 
step of the random walk is averaged over all the real- 
izations of the graphs. This approximation is a kind of 
mean field approximation done in statistical mechanics. 

Let Pij be the probability of reaching site j from site 
i in one step. It is obvious that J^i^i.j = 1- Using 
independent step approximation, we get 



P 



Thus the probability of reaching a site m at any time 
step is ^zj- Hence, the probability of reaching site to for 
the first time in t time steps is 



(6) 



Pm(t) = 

Thus D m is given by 

An = J2 tPm ^ 

t 

= n — 1 



C. The intermediate case (0 < p < 1) 



(7) 



Fig. H shows the results of average access time on the 
graph with 1000 nodes and k = 5. Two distinct behaviors 
of average access time can be identified from the Fig. ||. 
For small values of to, the behavior is similar to that of 
regular case, while for the larger values of to the average 
access time saturates and behaves like that of a random 



FIG. 3: The plot of average access time vs the distance m 
from the starting point m = for graph with 1000 nodes and 
k = 5, generated for various values of parameter p. The av- 
erage access time clearly show the crossover from the regular 
to random behavior with increasing m. Note the logarithmic 
scale for the average access time. 



graph (See Sec. [II B| ). As p increases the saturation of 
average access time becomes more prominent and the 
crossover from regular to saturation behavior takes place 
at smaller and smaller values of to. For p = 0.01, which 
corresponds to the small- world behavior [|l], the average 
access time behavior is almost same as that of the random 
graph except for small values of to of the order of k. 

To obtain analytical estimates of average access time 
we again make use o f the independent step approximation 
defined in Sec. [II B| . In this approximation each step in 
the walk is taken with a probability (1 — p) to nearest 
2k sites and with probability p to the remaining sites. In 
analogy with the p — case we write a recursion relation 
for A,m (See Appendix). 



1 fe 

A,m = (1 ^(A+i,™ + A-;,m) + 1 



1=0 



1 



2k- 



(8) 



, ,i + k 



This is a 2fc-th order difference equation. As in the 
case of p = 0, we first consider the case k = 1. The 
recursion relation becomes 



7i(Di+l,m + A-l,m) + l 
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+P[— S E A.m + 1 

1=0 

i =/= i,i — 1,2 + 1 
which can be written as 

(1 -p)[-(A+l,m + A-l,m) + 1 

71 — 1 

V)A,m + l 7,(D 



(9) 



n — 3 



with little algebra, we finally get 

A,m = £(A+l,m + A— l,m) + C 



+l,m 



(10) 



where 



and 



c 



(n — 3) — p(n — 1) 
2(n-3+p) 



P S"=o Am + n - 3 
n — 3 + p 



Note that the sum ^™T A,m is independent of the 
site index and can be treated as a constant to be deter- 
mined self-consistently from the solution. We solve the 
different equation ( |l(i| ) using the standard methods JhJ. 



D i<m = A9 { ™- 1) + B6 {m - l) 



c 



2£-l 



where 



e± = -(i±vT r 4^) 

The constants A and B are determined using the 
boundary conditions D m ^ m = and D m ^ n m = 0. The 
solution is given by 



A 



C 



2£ - 1 



3 — m a — tl n — m n — n n — ra , n—m 



8v n - 8: 



i id 



where without loss of generality we have put the starting 
point as i = 0. 

The sum y^ m _ n D m occurring in £ is calculated by 

F m and 

(n-3) 



summing Eq. ( pd| ) for all values of m and is given by 

n-l 



An = 



m=0 



P 1 + n l Em=l 



iflflOO 



A-l|m) 



Note that Eq. (0) and further analysis exhibit random 
behavior as p — > (T^pp^ rather than p — 1 (See Ap- 
pendix) . 

For general value of k, we again write a scaling relation 
similar to that of the case p = 0. 

D k m (n,p) « (1 + ij(p, fc) ln(fe))£>L (J,p) (12) 
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FIG. 4: Plot of average access time vs the distance m from 
the starting point of the random walk. The points are the 
numerical results for n = 1000, k — 5 and p — 0.0001 (open 
circle), p = 0.001 (solid circle), p — 0.01 (open triangle) and 
p = 0.1 (solid triangle). The lines are obtained using the 
scaling relation Eq. dl2|). 



The Fig. [| shows the match of the numerical data with 
the analytic expressions. The parameter p has a weak 
dependence on k for non-zero p which we have neglected 
in further analysis. Fig. shows the behavior of p for 
the various values of p. In Fig. |J it is clearly seen that 
for small and large values of p the match numerical and 
equation is quite good but for intermediate values there 
is a considerable deviation from the numerical results for 
small values of m. This fact can be understood as follows: 

For small values of p, i.e., when p » 10 -4 , the graph 
is nearly regular and the blocks of size k are nearly com- 
pletely connected graphs, but as p increases the proba- 
bility of leaving the block randomly to far away point 
increases, giving rise to higher average access time for 
the nearby points in the block, than the analytical val- 
ues. Again for high values of p the expression has a good 
match because as far as the average access time is con- 
cern the completely connected block and random block 
be have in similar way. 



III. DISCUSSION 

It is interesting to compare the limiting behaviors of 
intermediate cases with those of results for the regular 
and random cases. The Eq. (0) reduces to Eq. (§) as p — » 
0. However, the solution (Eq. (pi])) does not smoothly 
reduce to Eq. (||). This is because of the degeneracy in 
the roots of quadratic indicial equation of Eq. (0) for p = 
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FIG. 5: Plot of fi{p) as a function of p. 



which is lifted for non-zero p. Thus the solution changes 
from a polynomial to an exponential in site index as p 
becomes non-zero. This explains the reason behind the 
saturation of the solution for non-zero p for asymptotic 
m which can not be obtained for p = 0. 

For the random case we again find a similar situation. 
As p — > , which corresponds to the random case 

(see Appendix), the Eq. (||) reduces to the D m — ( and is 
consistent with the solution D Q = and D m = n—1, m ^ 
0. However, the solution (Eq. (pi])) does not smoothly 
reduce to a constant function because one of the roots 
(6> + ) diverges. 

Next, we consider the average access time for the dia- 
metrically opposite node i.e., G(p) = D^(p). This quan- 
tity is of interest as it should be correlated to the average 
cover time for the graph. 

Fig. H shows the behavior of the access time of the di- 
ametrically opposite point normalized with that of p = 
case, i.e. G(p)/G(0), as a function of p. The figure also 
shows the graph of characteristic length L(p)/L(0) [Q, 
which is the average shortest path between any two sites. 
Both the curves in the graph corresponding to the short- 
est path and the random walk show similar behavior. 
This observation has interesting consequences. The ran- 
dom spread on a small-world network considerably re- 
duces the access time compared to the regular graph as 
in the case of the shortest path spread. This can be 
very useful in applications such as routing and switching 
where random routing is cheaper. The determination of 
shortest path is generally very expensive (the number of 
operations goes as n 3 ) and also in many cases the com- 
plete information of the network is not available. Ex- 
amples of such networks are social networks, telephone 
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FIG. 6: Plot of G(p)/G(0), i.e., normalized average access 
time for the diametrically opposite point of the graph, vs p. 
The plot also shows characteristic length L(p)/L(0) as a func- 
tion of p. Both the quantities show a similar behavior. In 
particular the sharp drop near small-world regime (p ~ 0.01) 
to the value corresponding to the random case is clearly seen 
in the figure. 



networks, internet etc. where the number of nodes is 
very large. In such cases random routing will be more 
effective and cheaper, particularly if the graph has the 
small-world geometry. In this case the average access 
time goes as 0(n). 

In the calculations we have introduced an independent 
step approximation, where we average over the various 
realizations of graphs at each time step. This is a kind 
of mean field approximation and has allowed us to ob- 
tain recursion relations for the average access time. We 
expect this approximation to be reasonably good for the 
random case. Even for the intermediate cases, the results 
obtained by this approximation are in good agreement 
with the numerical values. 

For general values of k the recursion relations for aver- 
age access time can not be solved. However, the behavior 
of average access time shows an interesting scaling rela- 
tion in k. As discussed in Sec. II A the scaling relation 
corresponds to a quotienting procedure where a graph 
of n nodes with 2k connections is scaled to a graph of 
^ nodes with nearest neighbor connections and preserv- 
ing the far-edges j2j. The scaling gives very good fit to 
the numerical data except for small values of m, when 
p > 0.001. 
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IV. SUMMARY 

In this paper, we have studied random walk on the 
family of small-world graphs. For the regular case the 
average access time shows linear behavior for small dis- 
tances and shows quadratic nature for large distances. 
An interesting scaling relation in k is observed for the 
average access time. For the random case the average 
access time is (n — 1) and is independent of distance and 
k. An independent step approximation has enabled us 
to get the analytical result for the average access time. 
The same approximation allows us to write a recursion 
relation for the intermediate values of p. For intermedi- 
ate cases the average access time shows a crossover from 
regular to random behavior with increasing distances. 

The normalized average access time of the diametri- 
cally opposite nodes shows almost identical behavior as 
that of the characteristic length as a function p. This ob- 
servation can be very important in several applications 
where the number of nodes in graph are very large or 
the complete information about the graph is not avail- 
able. In these cases random routing or switching will be 
beneficial and cheaper. 

We thank Prof. V. Balakrishnan, Prof. G. Baskaran, 
Dr. G. Menon and Dr. R. Ramanujam for fruitful dis- 
cussions. 



V. APPENDIX 

A. Derivation of equation (^) 

We assume that the probability of breaking of an edge 
is p. We identify that the contribution to the average 
access time of site j comes from three types of events. 



1. All the 2k neighbors of site j are connected to j. 
The probability associated with this event is (1 — 
p) 2k . 

2. Some of the neighbors are connected to j (say 
2k — r) and r are connected to the far away 
sites. The probability associated with this event 
is p r {l -p) 2k ~ r . 

3. None of the neighbors of j are connected to j. The 
probability associated with this event is p . 

By independent step approximation the degree of each 
node is 2k. This enables up to write a recursion relation 
for average access time using the properties of expecta- 
tion value of conditional probability . We write 



2k- 



^ 2k C r p r (l-p) 2k ~ T 
r=0 



2k 



2k 



.1=1 



(1 



k 

Ilk ^ 

1=0 



(D 



i-\-l,m 



+ D t 



i—l.m 



l 



D 



2k -1 

I i 



In the above calculation we have assumed that the edge 
broken with probability p is not rewired to one of the 2k 
nearest neighbors. Due to this the random graph limit 



corresponds to p 



(n-2fc-l) 



instead of p = 1. This can 



be seen by equating the probability of a connection to 
a nearest neighbor to the probability of a connecting to 
any of the other sites 



(1-P) 
2k 



P 



2k - 1 
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